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Description 



METHOD AND SYSTEM FOR OPTIMIZED INSTRUCTION 
FETCH TO PROTECT AGAINST SOFT AND HARD ERRORS 



BACKGROUND OF INVENTION 

[0001] This invention relates to the field of computerized data processing and, in particular, 
to a method and system for detecting error and implementing error correction during 
transfer of instructions from memory to a computer processor. 

[0002] Embedded systems typically have a computer processor and an attached instruction 
memory unit that feeds instructions to be executed by the processor. The instruction 
unit must minimize instruction access time by providing a fast path to stored code. If 
instructions are not provided at the rate in which they are executed, the processor will 
stall, and precious cycles will be wasted waiting for code. 

[0003] As embedded computer memory scales to smaller and smaller sizes, soft error rates 
have increased significantly. This is due in major part to the ever-increasing 
integration factor and decreasing feature dimensions. Unfortunately, soft errors can 
cause major problems in embedded systems. One example of particular concern is 
executing an incorrect instruction. If a soft error mutates an instruction, problems 
running the gamut from corrupted data to system failure can arise. None of this is 
acceptable in most applications; Even worse yet, the error may not be immediately 
obvious, making detection and correction more difficult. 

[0004] An error correcting code (ECC) technique is generally used to improve data integrity. 
ECC involves adding a number of check bits to a data word in order to detect and 
correct one or more bits that have flipped while the data was stored. Unfortunately, 
ECC generally carries a penalty with its use. The data word and check bits must be 
decoded to determine if correction is needed and which bit to correct. Typically, one 



or more additional clock cycles are required in order to accomplish this correction, 
and ECC implementations generally suffer this performance penalty. However, since 
data integrity is crucial, the tradeoff in performance is made. 
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[0005] An example of a prior art ECC mechanism for accessing instructions can be found in 
U.S. Patent No. 6,108,753, which employs an automatic retry when an ECC or parity 
error is detected. Unfortunately, this method may suffer a performance penalty 
because the processor cannot assume it will have a valid instruction in all cases. 
Therefore, the processor must wait for an error determination before deciding 
whether or not it can proceed. 

[0006] In the error detection and correction system of U.S. Patent No. 4,646,31 2, a parity 
error signal pauses the processor clock when asserted. During this pause, ECC 
correction is done on correctable data, and the processor clock is restarted when 
corrected data is available. Though this method improves performance execution over 
waiting for corrected data each cycle, the error signal must be able to halt clocks 
before an invalid instruction is executed. However, adding the ability to halt clocks to 
a processor can seriously limit pipelining within the processor, translating to extended 
latency and lower performance. Further, the delay from halting the clock during ECC 
correction of a corrupted instruction is experienced regardless of whether or not the 
instruction in question is actually executed. This is due to the fact that the delay 
occurs during retrieval. 

SUMMARY OF INVENTION 

[0007] Bearing in mind the problems and deficiencies of the prior art, it is therefore an object 
of the present invention to provide an improved method and system for detecting 
error during transfer of data signals from a data memory, particularly instructions 
from an instruction memory, to a computer processor. 

[0008] It is another object of the present invention to provide a method and system for 

detecting error during transfer of data signals from a data memory that maximizes 
data integrity. 

[0009] A further object of the invention is to provide a method and system for detecting error 
during transfer of data signals from a data memory that permits correction of the 
error with fewer penalties to performance of the processor. 

[0010] 

The above and other objects, which will be apparent to those skilled in art, are 
achieved in the present invention which is directed to a method of detecting error 
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during transfer of data signals from a data memory to a computer processor 
comprising initially commencing transmission of a raw data signal from a data 
memory to a computer processor, wherein the raw data signal includes an error 
detection code. At the time of the commencement of transmission of the raw data 
signal from the data memory to the computer processor, the method then includes 
checking the raw data signal for corruption based on its error detection code. If the 
data has not been corrupted, the method includes completing transmission of the raw 
data signal to the computer processor. However, if the error detection code indicates 
data corruption, the method then includes substituting the raw data signal with a 
predetermined reserved data signal and transmitting the predetermined reserved 
signal to the computer processor. Further, the method includes processing the raw 
data signal or the reserved signal with the computer processor. If the error detection 
code indicates data corruption, the method may further include determining if the 
corrupted data in the original raw data signal may be corrected, and subsequently 
retrieving the corrected data and processing the corrected data signal with the 
computer processor. 

[001 1] If the computer processor processes the predetermined reserved signal, the method 
may also include determining whether corrupted data in the raw data signal has been 
corrected, and, if corrected, subsequently retrieving the corrected data and processing 
the corrected data signal with the computer processor. If the corrupted data in the raw 
data signal has been corrected, the raw data signal in the data memory may be 
replaced with the corrected raw data signal. 

[001 2] If the computer processor processes a predetermined reserved instruction, the 

computer processor may execute an error handling routine comprising determining 
whether corrupted data in the raw data signal has been corrected* and, if corrected, 
retrieving the corrected data signal, processing the corrected data signal with the 
computer processor, and replacing the raw data signal in the data memory with the 
corrected data signal. 

[001 3] Preferably the computer processor operates on timed, uniform clock cycles, and the 
steps of transmitting the raw data signal from the data memory to a computer 
processor, simultaneously checking the raw data signal for corruption, and 
transmitting either the raw or predetermined reserved signal to the computer 
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processor, are all performed within a single clock cycle. 

[001 4] The method further preferably includes commencing transmission of a subsequent 
raw data signal from the data memory to the computer processor and repeating the 
aforementioned steps for the subsequent raw data signal, until all desired raw data 
signals from the data memory are processed by the computer processor. 

[001 5] The checking of the raw data signal for presence of data corruption is preferably 

performed simultaneously with the commencement of transmission of the raw data 
signal from the data memory to the computer processor. 

[001 6] Preferably, the error detection code may comprise ECC code, and the raw data signal 
may be a multi-bit data signal. More preferably, the raw data signal is an instruction 
from a random access memory associated with the computer processor. 

[001 7] In another aspect, the present invention is directed to a program storage device 

readable by a machine, tangibly embodying a program of instructions executable by 
the machine to perform the aforementioned method steps for detecting error during 
transfer of raw data signals from a data memory to a computer processor. 

[0018] 

In a further aspect, the present invention is directed to a system for detecting error 
during transfer of data signals from a data memory to a computer processor 
comprising a computer processor and a data memory device containing raw data to be 
processed by the computer processor, wherein the raw data being in the form of a raw 
data signal including an error detection code. The system further includes a data 
checker device adapted to check the raw data for presence of data corruption and, if 
the error detection code indicates data corruption, determining if the corrupted data 
in the original raw data may be corrected. The system also includes a computer 
processor instruction unit. The instruction unit is adapted to commence transmission 
of a raw data from the data memory device to the computer processor and cause the 
data checker device to check the raw data signal for corruption via decode of the data 
and error detection code. The instruction unit is further adapted to cause completion 
of transmission of the raw data to the computer processor if the error detection code 
indicates no data corruption or, if the error detection code indicates data corruption, 
cause substitution of the raw data with a predetermined reserved instruction and 
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transmission of the predetermined reserved instruction to the computer processor. 

[001 9] The computer processor, upon processing a predetermined reserved instruction, may 
be adapted to execute an error handling routine comprising determining whether 
corrupted data in the raw data has been corrected, and, if corrected, retrieving the 
corrected data, processing the corrected data with the computer processor, and 
replacing the raw data in the data memory device with corrected data. 

[0020] The computer processor may include a clock, such that the processor adapted to 
operate on timed, uniform clock cycles produced by the clock. Preferably, the 
computer processor instruction unit is adapted to cause transmission of the raw data 
from the data memory device to the computer processor, the data checker device is 
adapted to check simultaneously the raw data for presence of data corruption, and the 
computer processor instruction unit is adapted to transmit either the raw or 
predetermined reserved instruction to the computer processor, within every a cycle. 

[0021] The system may further include a corrected data register adapted to receive raw data 
corrected by the data checker device and transmit corrected raw data to the computer 
processor. The data memory is preferably a random access memory associated with 
the computer processor and the raw data is preferably an instruction from the random 
access memory for the computer processor. 

BRIEF DESCRIPTION OF DRAWINGS 

[0022] The features of the invention believed to be novel and the elements characteristic of 
the invention are set forth with particularity in the appended claims. The figures are 
for illustration purposes only and are not drawn to scale. The invention itself, 
however, both as to organization and method of operation, may best be understood 
by reference to the detailed description which follows taken in conjunction with the 
accompanying drawings in which: 

[0023] Fig. 1 is a schematic view of the preferred instruction memory controller of the 
present invention. 

[0024] Fig. 2 is a schematic view of the instruction memory controller of Fig. 1 showing 
additional preferred components in accordance with the present invention. 

DETAILED DESCRIPTION .. - 
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[0025] In describing the preferred embodiment of the present invention, reference will be 
made herein to Figs. 1 and 2 of the drawings in which like numerals refer to like 
features of the invention. 

[0026] In its preferred embodiment, the present invention provides a method and apparatus 
to allow an instruction on-chip memory unit (l-OCM) associated with a computer 
processor to implement error correcting code (ECC) to maximize data integrity while 
minimizing performance overhead. An example of a computer processor is the 
PowerPC 405 manufactured by the assignee, International Business Machines 
Corporation of Armonk, NY. The term data is to be understood to include instructions 
to be executed by the computer processor, The invention implements ECC code to 
protect against soft and hard errors while providing instructions in a timely manner to 
avoid stalling the processor operation. When an error is detected, the unit replaces the 
corrupt instruction on the fly. This will then allow the processor to retrieve the 
corrected instruction in a subsequent operation. Thus the processor only incurs a 
performance penalty when an error is actually encountered rather than on every 
memory access, and improves the classic data integrity vs. performance penalty 
trade-off. 

[0027] 



As shown in Fig. 1 , an instruction side on-chip memory (l-OCM) unit 1 0 provides instructions t 
processor unit (CPU) 12 via the processor's l-OCM interface 11. The purpose of the l-OCM unit 
CPU instruction access time by providing a fast path to stored code. The l-OCM also provides tl 
code to be read from other data sources over the processor local bus (PLB) slave 1 8 and to be i 
fly, thus enabling dynamic error detection and correction. The corrected data register (DCR) 14 
instructions or other data that has been corrected. As is standard, CPU 1 2 includes a clock, anc 
operates on timed, uniform clock cycles produced by the clock. Clock 16 and reset 20 inputs ai 
OCM controller 10. 
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The interface signals shown in Fig. 1 are described below in Table 1 : 



Table 1 



Signal Name 


I/O 


Interface 


i Description 










CPUjsocmReqPending 


IN 


CPU 


One of a set of signals which In 
various combinations indicate a 
processor request for one or two 
32-bit instructions. It is asserted by 
the processor to indicate that a 
fetch request is pending in the 
CPU. 


CPU isocm leu Ready 


IN 




Another in the set of signals used to 
generate a fetch request from the 
processor. When asserted with 
ReqPending, a new fetch request is 
being presented. 


CPU isocmABus(0:29) 


IN 




Address of the instruction(s) being 
requested by the CPU. 


CPU-isocmXIateValid 


IN 




Part of the group of signals used by 
the processor to request an 
instruction fetch. This signal 
normally has an additional function 
to validate storage attributes, 
however, that function is not being 
used in this design. 


CPUjsocmAbort 


IN 




Indicates that the currently 
outstanding fetch request is being 
aborted. No RdDValid bits may be 
asserted for the associated request 
after the cycle in which the Abort is 
asserted, If ReqPending is asserted 
along with Abort, a new fetch 
request is being presented. 


ISOCM<puHold 


OUT 




The Hold signal is asserted by the 
ISOCM to indicate to the processor 
that it is working on its request, but 
unable to complete the transfer 
during the current cycle. 


ISOCM-cpuRdDValid(0:1) 


OUT 




Validates data words presented to 
the processor on RdDBus(0:63) 
ines. 

Bit 0: Even word data valid an 
RdDBus (0:31) 

Bit 1 : Odd word data valid on 
RdDBus (32:63) 


ISOCM<puRdDBus(0:63) 


OUT 




64-bit data bus used to transfer 
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I instructions to the CPU. 


SLV isocmRd Req 


IN 


PLB 


I Read request signal 


SLV isocmWr Req 


IN 




| Write request signal 


SLV isocmRd Abus(0:31) 


IN 




Read address 


SLV isocmWr Abus(0:31) 


! IN 




I Write address 


SLV isocmRd BE(0:3) 


IN 




I Read byte enables 


SLV isocmWr BE(0:3) 


IN 




I Write byte enables 


SLV isocmRd Nwords(0:3) 


IN 




| Read number of words 


SLV isocmWr Nwords(0:3) 


IN 




I Write number of words 


SLV isocmWr Data(0:31) 


IN 




Write data bus 


SLV_isocmSampleCycleL2 


IN 




I PLB Sample Cvcle to synchronize 
PLB activity when running at higher 
frequencies 


ISOCM sIvRd Ack 


OUT 




1 Read Acknowledge 


ISOCM slvWr Ack 


OUT 




1 Write acknowledge 


ISOCM sIvRd Data(0:31) 


OUT 




1 Read data bus 


ISOCM sIvTimeout R 


OUT 




. Array access error detected on read 


ISOCM sIvTimeout W 


OUT 




Array access error detected on write 


ISOCM sIvTargetAdr 


OUT 




Target Address of read data 


PGM dcrAddr(0:5) 


IN 


DCR Bus 


rru^TctmindPie ulk, unii select 


CPU dcrRead 


IN 




Indicates that the CPU is requesting 
to read data 


CPU-daWrite 


IN 




Indicates that the CPU is requesting 
to wnie uata 


-CPU_dcrABus(0:9) 


IN 




Indicates the address of the DCR 
which is being accesses for a read 
or write DCR operation 


XXX_dcrData(0:31) 


IN 




The 32-bit data bus output to 
transfer write data from the CPU. 
This bus also drives 0x00000000 
during DCR read operations 


ISOCM_daAck 


OUT 




Indicates that the DCR data is 
available on 

ISOCM_cpuDBusin(read), or that 
the new DCR data is latched (write) 


ISOCM-dcrData(0:31) 


OUT 


I r 


The 32-bit data bus used to transfer 
ead data from a DCR to the CPU 
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[0028] Fig. 2 shows details of the preferred system, wherein l-OCM unit 10 contains a large 
static random access memory (SRAM) 24 or other data memory device for instruction 
or other data code storage which is accessible for reads from CPU 1 2, and for both 
reads and writes over PLB 18. The l-OCM control unit 10 supplies instructions from 
the instruction SRAM 24 to CPU 1 2, via fetches over the CPU l-OCM interface 1 1 . 
Normally, these instructions are the raw, uncorrected output of the SRAM, which 
provides the best performance path. 

[0029] CPU transfers require valid requests before CPU instruction fetches may be made. 
During the CPU instruction fetches, the preferred CPU l-OCM interface 1 1 handles 
data as 64 or 32 bit transfers. The 72 bit SRAM data word width includes 64 data bits 
and 8 ECC check bits. In normal operation, raw, uncorrected SRAM data (minus the 
check bits) are transferred during a CPU access in order to minimize the number of 
cycles required for an instruction fetch. While SRAM 24 is retrieving the data, the I- 
OCM unit 10 signals its intention to provide the instruction to CPU 12 in the next 
cycle, to minimize the instruction access time. When the data is available from the 
SRAM 24, the instruction unit causes transmission of the raw data 29 from SRAM 24 to 
the MUX 28 and error correcting code, generation checking (ECC CEN/CK) unit 26 to 
test for data corruption, for example due to a soft error in SRAM 24. However, if the 
data is corrupt, there is not enough time to correct it, since the system is preferably 
optimized for the non-error case. 

[0030] In accordance with the present invention, l-OCM unit 1 0 continues to provide an 
instruction within the same CPU clock cycle, even when the instruction read from 
SRAM 24 is corrupt. If an ECC error is detected in the data fetched from the SRAM, 
instead of passing on the corrupt instruction, l-OCM unit 10 swaps in a 
predetermined error-indicating signal or instruction 30 in place of the retrieved 
corrupted data, and presents it to CPU 1 2 via _the multiplexer (MUX) 28, also within 
the same CPU clock cycle. The predetermined error-indicating signal is a properly 
formed but reserved instruction or operation code. An attempt is also made at the 
same time by ECC GEN/CK 26 to determine if the error is correctable, and then to 
correct the corrupted data by known correction techniques. Any corrected data 31 
which has been processed by ECC GEN/CK 26 will then be stored in the l-OCM 
corrected data register (DCR) 14. 
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[0031] If and when computer processor 12 attempts to execute the predetermined reserved 
error-indicating signal or reserved op-code 30, an interrupt is taken. The interrupt 
handler preferably then proceeds as follows with an error handling routine: 

[0032] 1 . Determine if the error was correctable by reading the l-OCM status register. 

[0033] 2. If the error was correctable, read the corrected data or instruction 31 from the I- 
OCM corrected data register (DCR) 14 via the DCR bus or interface 32. Then write the 
corrected data or instruction 31 back to SRAM 24, to substitute for the corrupted raw 
data 29, via a PLB 1 8 write (discussed further below). 

[0034] 3. If the error was not correctable, retrieve the correct original data or instruction from 
a back-up memory area attached to the PLB. 

[0035] 4. Continue CPU 12 processing with the corrected instruction. 

[0036] The predetermined error-indicating signal that is substituted for the corrupt data 
signal or instruction therefore serves several purposes. First, it prevents a protocol 
violation on l-OCM interface 1 1 by providing an instruction when one was promised. 
Second, if the instruction is never executed, due to an immediately preceding jump 
instruction, there is no delay in continued processor execution, though the SRAM may 
still be corrected later if desired. Finally, when the substituted instruction is executed, 
it preferably causes processor 12 to execute an error handling routine which can 
retrieve the corrected instruction stored by l-OCM unit 10, update SRAM 24, and 
continue execution of the instructions. Though this may take multiple cycles, the 
performance penalty is only paid when an error is encountered. Amortized over 
millions of instructions, the penalty for vastly improved data integrity is minimized. 

[0037] For PLB-originating writes of the corrected data or instruction, l-OCM controller 10 
provides the capability to load code into SRAM 24 via PLB 1 8. However, since the PLB 
Slave 1 8 interface is only 32 bits, each of these writes must actually be a read- 
modify-write to the 72 bit wide ECC-protected SRAM. The read-modify-write will 
basically be transparent to the PLB interface. l-OCM controller 1 0 may perform all of 
the steps necessary for the write as follows: 

[0038] 1 . Latch in the PLB address bus and 32 bit write data. 
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[0039] 2. Wait until any currently executing l-OCM interface 1 1 request has completed. 

[0040] 3. Initiate an SRAM read of raw data 29 from the specified address. 

[0041] 4. Wait for ECC-corrected data 31 to be generated and stored in the corrected data 
register, DCR 14. 

[0042] 5. Using switch 34, combine the 32 bit write data from PLB 1 8 with the 

complementary half of the corrected data 31 from corrected data register, DCR 14, via 
latch 36. 

[0043] 6. Initiate an SRAM write, which will regenerate the ECC check bits and store the new 
72 bit double word into the SRAM. 

[0044] The read-modify-write operation will likely take multiple CPU clock cycles, so 
subsequent PLB accesses to the instruction side of SRAM 24 may be delayed, 

[0045] A PLB read of data or instruction from instruction side of SRAM 24 is somewhat 

simpler than a write, requiring l-OCM controller 10 to perform the following steps: 

[0046] 1 . Latch in the PLB address bus. 

[0047] 2. Wait until any currently executing l-OCM interface 1 1 request has completed. 
[0048] 3. Initiate an SRAM read of raw data 29 from the specified address. 
[0049] 4. Wait for ECC-corrected data 31 to be generated. 

[0050] 5. Choose the upper or lower half of the corrected data via MUX 38 depending on the 
requested address. 

[0051] A PLB read from l-SRAM will likely take multiple CPU cycles to complete. 
[0052] 

In the ECC operations, l-OCM controller 1 0 preferably utilizes a (72,64) Error 
Correcting Code to improve the reliability of the Instruction SRAM. Eight check bits are 
appended to the 64 data bits to provide single error correction and double error 
detection. All occurrences of both correctable (single bit) and uncorrectable (multiple 
bit) errors will cause the appropriate status / interrupt bits to be set in the l-OCM 
status/interrupt register. The l-OCM Interrupt will be asserted only when an 
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uncorrectable error has been detected as a result of either a PLB or OCM interface 
request. 

[0053] Thus, the present invention to provide an improved method and system for detecting 
instruction error during transfer of data signals from a data memory to a computer 
processor. The invention permits the processor to proceed whether or not an error is 
detected in the accessed instruction, and the processor is only delayed when an error- 
indicating signal or instruction is actually executed. The method and system of the 
present invention therefore maximizes data integrity while permitting correction of 
the error with fewer penalties to performance of the processor. 

[0054] While the present invention has been particularly described, in conjunction with a 

specific preferred embodiment, it is evident that many alternatives, modifications and 
variations will be apparent to those skilled in the art in light of the foregoing 
description. It is therefore contemplated that the appended claims will embrace any 
such alternatives, modifications and variations as falling within the true scope and 
spirit of the present invention. 

[0055] Thus, having described the invention, what is claimed is: 
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